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O ! Abstract 
(N 

^ , The article presents a study on the biobjective inventory routing problem. Contrary to most 

• previous research, the problem is treated as a true multi-objective optimization problem, with the goal 

of identifying Pareto-optimal solutions. Due to the hardness of the problem at hand, a reference point 
based optimization approach is presented and implemented into an optimization and decision support 
system, which allows for the computation of a true subset of the optimal outcomes. Experimental 
investigation involving local search metaheuristics are conducted on benchmark data, and numerical 
I results are reported and analyzed. 

1 Introduction 

> 

Q>^ I The Inventory Routing Problems (IRP) is a true multi-objective optimization problem, in which decisions 

O ' from inventory management intersect with the classical vehicle routing problem (VRP). In its context, 

delivery quantity decisions need to be made, which then lead to a series of multi-period vehicle routing 
^ I problems. Several different capacity constraints, e. g. at the customers' and of the trucks, influence the 

choice of the delivery quantities. Consequently, the two areas overlap to a considerable degree, and 
thus the solution of the IRP is not only J\fV-haid but also challenging when 'only' trying to find good 
solutions by means of heuristic approaches. 
^ , A number of articles on the IRP has appeared in the past. While first approaches can be traced 

^ ' back to the 1980s 11113, a considerable growth becomes apparent in past years, with a recent survey 

given in 121 . Often, metaheuristics are used to solve the IRP. Some prominent examples are Iterated 
Local Search flOl . Variable Neighborhood Search fT3ll . Greedy Randomized Adaptive Search 13, and 
Memetic Algorithms f3 |. 

Common to most previous work on the problem is the combination of the two aspects, i. e. the 
minimization of the inventory levels and the routing costs, into an overall cost function. This is possible 
if such partial cost functions can be formulated. Then however, the tradeoff between the two aspects is 
not delivered in those mono-criterion models. When planning on a tactical level however, such issues 
may quickly arise. Here, the impact of changes in one partial cost-function on the other becomes relevant. 
Practical examples include changes in fuel prices, the orientation towards 'green' logistics, and similar 
considerations. We believe that multi-objective models may contribute to the better understanding of the 
IRP with respect to such tactical planning situations, and consequently present a study on this topic. 

In the following Section [2l we briefly introduce the investigated IRP. The methodological approach 
used to solve the problem is then described in Section[3l Experiments and results on available benchmark 
data follow in SectionlH and conclusions are presented in Section|5j 
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2 Description of the investigated IRP 

In line with our formulation described in [Si, we consider an IRP in which a given set of n customers 
needs to be delivered with goods from a single depot. With respect to the vehicle routing part of our 
IRP, the characteristics of the classical capacitated vehicle routing problem (CVRP) are considered. This 
means that we assume a unlimited number of available trucks, each of which has a limited capacity C 
for the delivered goods. 

Decision variables of the problem are on the one hand delivery quantities qn for each customer i, 
i = 1, . . . , n, and each period t,t = 1, . . . , T. On the other hand, a VRP must be solved for each period 
t of the planning horizon T, combining the delivery quantities qn into tours/routes for the involved fleet 
of vehicles. We assume qn > Vi,t, thus forbidding the pickup of goods at all times. Following the 
definition of the classical CVRP, we do not permit split-deliveries, and therefore qu < C Vi, t. 

At each customer i, a demand du is to be satisfied for each period t. Inventory levels Ljt at the 
customers are limited to a maximum amount of Qi, i. e. La < Qi^i, t. An incoming material flow 
(f)'l^ and an outgoing material flow (/)^ links the inventory levels at the customers over the time horizon: 
Lit+i = Lit + (pf^ - 4>^^ yi,t = I, . . . ,T - 1- The incoming material flow is given by cpf^ = qu iff qu < 
Qi — Lit^i, and 0^ = Qi — otherwise. On the one hand, this implies that the maximum inventory 
levels are never exceeded, independent from the choice of the shipping quantities qu. On the other hand, 
delivery quantities qu with qu > Qi — Ln-i can be excluded when solving the problem. Obviously, 
as all inventory levels La > 0, any delivery quantity qu > Qi will not play a role also. The outgoing 
material flow assumes = da iff da < Ln-i + (/>J, and (pu = ^it-i + (pft otherwise. The latter case 
is also referred to a stockout-situation, and the stockout-level can be computed as dn — Ln-i — qu- As 
we however assume da to be known in advance, we can easily avoid such situations by shipping enough 
goods in advance or just in time. 

Two objective functions are considered, leading to a biobjective formulation. First, we minimize the 
total inventory as given in Expression ([T|l. Besides, the minimization of the total distances traveled by 
the vehicles for shipping the quantities in each period is of interest. While the solution of this second 
objective as such presents an MV-haid problem, we denote this second objective with Expression (|2]|. 

n T 

min Lit (1) 

i=l t=l 
T 

min Y ^^^t (qu,- ■ ■ ,qnt) (2) 
t=i 

The two objectives are clearly in conflict to each other. While large shipping quantities allow a 
minimization of the routes, small quantities lead to low inventory levels over time. In between, we can 
expect numerous compromise solutions with intermediate values. 

Without any additional information or artificial assumptions about the tradeoff between the two func- 
tions, no sensible aggregation into an overall evaluation is possible. Consequently the solution of the 
problem at hand lies in the identification of all optimal solutions in the sense of Pareto-optimality, which 
constitute the Pareto-set P. 

3 Solution representation and heuristic search for the IRP 

The solution approach of this article is based on the decomposition of the problem into two phases. First, 
quantity decisions of when and how much to ship to each customer are made. This is done such that only 
customers running out of stock are served, i. e. we set qit > iff dit > Lit-i Vi, t. Second, the resulting 
CVRP is solved for each period t by means of an appropriate solution approach. 

Solving the IRP in this sequential manner has several advantages. Most importantly, we are able 
to introduce a compact encoding for the first phase, which allows an intuitive illustration of how the 
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results of the procedure are obtained. We believe this to be an important aspect especially for decision 
makers from the industry. Besides, searching the encoding is rather easy by means of local search/ 
metaheuristics. 

3.1 Solution encoding 

Alternatives are encoded by a n-dimensional vector vr = (vri, . . . , tt„) of integers. Each element tTj of the 
vector corresponds to a particular customer i, and encodes for how many periods the customer is served 
in a row (covered). In the following, we refer to these values vTj as a customer delivery 'frequency'. 
Precisely, if du > Lu-i, then qu may be computed as given in Expression 



One the one hand, a day-to-day delivery policy is obtained for values of vTj = 1. On the other hand, 
large values vTj ship up to the maximum of the customer/ vehicle capacity, and intermediate values lead 
to compromise quantities in between the two extremes. 

On the basis of the delivery quantities, and as mentioned above, VRPs are obtained for each period 
which do not differ from what is known in the scientific literature. 

3.2 Heuristic search 

Throughout the optimization runs, an archive P of nondominated solutions is kept that presents a heuris- 
tic approximation to the true Pareto-set P. By means of local search, and starting from some initial 
alternatives, we then aim to get closer to P, updating P such that dominated solutions are removed, and 
newly discovered nondominated ones are added. 

In theory, any local search strategy that modifies the frequency values of vector vr might be used for 
finding better solutions. Clearly, values tTj < 1 are not possible, and exceptional large values do not 
make sense either due to the upper bounds on the delivery quantities as given in Expression ([3]l. 

3.2.1 Construction procedure 

First, we generate solutions for which the delivery frequency assumes identical values for all customers, 
starting with 1 and increasing the frequency in steps of 1, up to the point where the alternative cannot 
be added to P any more. Then, we construct alternatives for which the frequency values are randomly 
drawn between two values: j and j + 1, thus mixing the frequency values of the first phase and therefore 
computing some solutions in between the ones with purely identical frequencies. The so obtained values 
of TT are then used to determine the delivery quantities as described above. 

The vehicle routing problems are then solved using two alternative algorithms. On the one hand, 
a classical savings heuristic |6l is employed, which allows for a comparable fast construction of the 
required routes/ tours. On the other hand, the more advanced record-to-record travel algorithm ||9l is 
used. Having two alternative approaches is particularly interesting with respect to the practical use of 
our system. While the first algorithm will allow for a fast estimation of the routing costs, improved 
results are possible by means of the second approach, however at the cost of more time-consuming 
computations. 

3.2.2 Improvement procedure 

The improvement procedure in this article is based on modifying the values within vr, and re-solving the 
subsequently modified VRPs using the algorithms mentioned above. For any element of P, we compute 
the set of neighboring solutions by modifying the values vTj by ±1 while avoiding values of vTj < 1. 
Therefore the maximum number of neighborhood solutions is 2n. The search for improved solution 




(3) 
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naturally terminates in case of not being able to improve any element of P. In this sense, the approach 
implements the concept of a multi-point hillclimber for Pareto-optimization. 

Contrary to searching all elements of P, a representative smaller subset of P must be taken for the 
improvement procedure. This reduces the computational effort, which we have been able to observe in 
our previous investigations [8], to a considerable extent. One possibility is to assume a set of reference 
points, and to select the elements in P that minimize the distances to these reference points. As a 
general guideline, the reference points should be chosen such that the solutions at the extreme ends of 
the two objective functions ^ and (O are present in the subset. A favorable implementation of the 
distance function lies in the use of the weighted Chebychev distance metric, as it allows for a selection of 
convex-dominated alternatives and thus provides some theoretical advantages over other approaches {VT\. 
Figure[T]illustrates the general use of reference points for the problem at hand. A more detailed discussion 
of the principle follows in section 14.21 




V u • ^ 

^ 

Normalized Inventory Cost 



Figure 1: The reference point (diamond), directions of search and selected solutions (black dots) 



4 Experimental investigation 
4.1 Proposition of benchmark data 

Using the geographical information of the 14 benchmark instances given in chapter 11 of fS^, new data 
sets for inventory routing have been proposed. While the classical VRP commonly lacks multiple-period 
demand data, we filled this gap by proposing three demand scenarios, each of which contains a total of 
T = 240 demand periods for all customers. 

Scenario a: The average demand of each customer is constant over time. Actual (integer) demand 
values for each period however are drawn from an interval of ±25% around this average. 

Scenario b: We assume an increasing average demand, doubling from the initial value at t = 1 to 
t = 240. Again, the actual demand is drawn from an interval of ±25% around this average. 

Scenario c: In this scenario, the average demand doubles from t = Ito t = 120, and goes back to its 
initial value in t = 240, following the shape of a sinus curve. Identical to the above presented 
cases a deviation of ±25% around these values is allowed. 

The resulting set of 42 benchmark instances have been made available to the scientific community 
under |http : / /logistik . hsu-hh . de/ IRP| and documented in lITTI . 
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4.2 Selection of reference points and decision support provided by the method 

A test program has been developed and is used to test new and innovative strategies. Figure |2] shows a 
typical screen shot of our solver. The upper part on the left gives the name of the instance and the vehicle 
capacity. Then below, the decision maker is able to display the different alternatives computed by the 
software. 

For the current alternative, and the current period, a text window gives the inventory level, the number 
of vehicles used and the information on the tours. The box in the bottom left part represents the evolution 
of the total inventory over all periods. The large window on the right presents the current alternative and 
period routing. Green bars are the stock level at the customer location for the period chosen with the 
slide bar at the bottom of the screen. 
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Figure 2: Screen shot of the Inventory Routing Solver. 

The subset of alternatives that we select is part of the critical step. We use the above mentioned 
reference point technique that seems to be the most appropriate in that kind of situations. To make the 
technique consistent, the two objectives have been normalized. Figure [T] depicts a typical situation. The 
complete set of non-dominated solutions is represented by the black and white dots. The two extreme 
solutions are highlighted by circles at the top left comer and the bottom right comer. These two extreme 
solutions plus the "ideal point" (the solution circled in the bottom left corner of the figure) having the 
best value in terms of routing cost and inventory cost compose the three initial reference points. 

Then depending on the number of final altematives that the decision maker wants, we split the two 
axes (represented by the dotted line) in equal proportions. In the case presented in Figure [TJ the two axes 
have been split in 4, generating 3 additional reference points for the the vertical axe, and 3 additional 
reference points for the horizontal axe. These new points are represented in the figure by the white 
diamonds. With each of the 9 reference points, we will select in the complete solution set, the solution 
that minimizes the Chebyshef distance metric. This will give us at most 9 solutions that will be used in 
the sequel for improvement as mentioned in section [J!2l The direction of search is indicated in Figure [T] 
with the arrows. 
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4.3 Results and discussion 

The following Table[T]reports the computational results of four chosen instances/ scenarios. In total, three 
different numbers of reference points have been tested: 3, 5, and 11. The choice of the three settings is 
based on the general explanations given in Figure [T] Three reference points are the minimum possible 
(sensible) setting for this biobjective case, and represent an approach in which a maximal reduction 
of the computational effort can be expected. A slightly more refined approximation of the Pareto set 
can be expected in case of five reference points (+1 on each axis). In comparison, the assumption of 
significantly more (eleven) reference points should then contribute to the better understanding of the 
relationship of the computational effort to the obtained results. 

Obviously, the computational effort needed until converging towards local optima heavily depends 
on the number of reference points. Besides, the characteristics of the data sets influence the difficulty to 
a considerable degree. On the one hand, smaller instances are solved faster, which has to be expected, 
of course. On the other hand, the average demand data has an impact, with instances of scenario type 
'a' being solved fastest. When comparing scenarios 'b' and 'c', a clear distinction with respect to the 
difficulty cannot be made. In some cases, the scenario with the growing average demand turns out to 
be more demanding, while sometimes the Pareto-optimal solutions of scenario 'c' are more costly to 
approximate. 
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Table 1 : Results for the first four instances (all scenarios) and three different numbers of reference points 
(R=3, R=5, R=ll). The number of steps denotes the number of local search steps (neighborhoods) 
needed to reach a local optimum. Besides, the total number of evaluated alternatives is given, and the 
size of the obtained approximation P is reported. The computation time (CPU) has been recoded on a 
single core of an Intel Xeon X5550 processor, and is given in seconds. 



Overall, it is possible to see that the number of reference points can be used to modulate the compu- 
tational effort, independent from the particular size or other characteristics of the data sets. With respect 
to a practical application of our method, this presents a meaningful finding. In an interactive search and 
decision making approach, it allows for a fast first approximation involving only few reference points, 
followed by a refined search in a preferred region. Here, the direction of search, i. e. the preferred region 
in which additional solutions are to be generated, is given by the decision maker who interacts with the 
system by means of an appropriate multi-criteria decision aiding technique. 

Nevertheless, and especially for the large instances, the computing times are still very high. Clearly, 
this is due to the assumption of T = 240 demand periods. In combination with an increasing number of 
customers, the computing times grow up to the point where they present a challenge. 

We have also closely analyzed the obtained results in outcome space, depending on the assumed 
number of reference points. More precisely, we investigated whether the results of a particular reference 
point setting lead to dominating results over other settings. Figure [3] reports four exemplary results, 
which act on behalf of the tested 12 cases. 

The analysis shows that, for the problem at hand, even the outcomes obtained when assuming 3 ref- 
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Figure 3: Results when testing a different number of reference points. Top left: instance GS-0 1-a . irp, 
50 customers, constant average demand (scenario a); top right: instance GS-0 2-a . irp, 75 customers, 
constant average demand (scenario a); bottom left: instance GS-0 3-b . irp, 100 customers, increasing 
average demand; bottom right: instance GS-0 4-c . irp, 150 customers, average demand following a 
sinus function (scenario c). 



erence points are helpful to the decision maker. Obviously, the approximation is improved by increasing 
the number of reference points at the cost of additional computations. In Figure [3l this is shown by an 
increasing 'convexity' of the Pareto-front. Despite these refinements, the results of smaller reference 
point sets are however not dominated. Overall, and in combination with the above presented analysis 
of the running times, this implies that first approximations involving few reference points are not only 
comparably fast computable, but also acceptable with respect to the obtained quality. 
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5 Conclusions 

The article presented a study on the biobjective IRP, a generalization of the classical IRP with its ag- 
gregated objective function. Challenging data sets comprising T = 240 periods have been investigated, 
which refer to applications in which optimization has to be done over a mid-term planning horizon. 

A local search approach employing reference points has been presented, implemented, and tested on 
the given benchmark instances. The approach has been formulated in response to previous research [8J, 
which clearly showed the difficulty of identifying all Pareto-optimal solution within reasonable time. 

By means of reference points, we may modulate the computational effort and thus contribute to 
solving such large and computationally challenging problems. Besides, the experiments show that for 
the investigated problem, outcomes obtained by few reference points are not dominated by the ones of 
larger reference point sets. In conclusion, we may state that the proposed techniques presents a suitable 
approach for such biobjective Inventory Routing Problems, especially when being confronted with larger 
data sets. 

Despite this progress, some open issues exist. On the one hand, the computing times are, in some 
cases, still large and should be improved as part of our future work. On the other hand, more effec- 
tive local search strategies should be tested, i. e. better neighborhood structures that allow for a faster 
convergence. 
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